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Abstract: In wireless sensor and actor networks (WSAN), the 
sensor nodes have a limitation on lifetime as they are equipped 
with non-chargeable batteries. The failure probability of the 
sensor node is influenced by factors like electrical dynamism, 
hardware disasters, communication inaccuracy and undesired 
environment situations, etc. Thus, fault tolerant is a very 
important and critical factor in such networks. Fault tolerance 
also ensures that a system is available for use without any 
interruption in the presence of faults. In this paper an 
improved fault tolerance scheme is proposed to find the 
probability of correctly identifying a faulty node for three 
different types of faults based on normal bias. The nodes fault 
status is declared based on its confidence score that depends 
on the threshold valve. The aim is to find the Correct 
Recognition Rate (CRR) and the False Fear Rate (FFR) with 
respect to the different error probability (p e ) introduced. The 
techniques, neighboring nodes, fault calculations, range and 
CRR for existing algorithm and proposed algorithm is also 
presented. 

Keywords: Wireless Sensor and Actor Networks, Fault 
Tolerance, Normal Bias, Failure Probability, Correct 
Recognition Rate, False Fear Rate. 

I. Introduction 

Wireless sensor networks (WSN) consist of spatially 
distributed sensor nodes that can communicate with each 
other, the desired physical quantity such as temperature, 
sound, tremor, force, motion or noxious waste are measured 
by the sensors. The Sensors are devices with low cost and 
powered having limited energy for computation and wireless 
communication capabilities. WSNs are advantageous 
because the end devices are compact and rugged, requiring 
little power to operate and can be deployed for reliable, long 
term, remote operations. WSNs enabled numerous advanced 
monitoring and control application in environmental, 
biomedical, military and other applications. Some of the most 
common WSN applications include environmental analysis, 
physical health checking and device watching. The 
Communication in sensor networks is unpredictable and 
failure-prone due to which the network is said to be vulnerable. 
When the network is supporting more and more applications 
and services, the impact of failure of a network will be more 
obvious, so improving networks fault tolerance is practical 
and important. 

Data delivery in sensor networks is inherently faulty and 
unpredictable. Failures in wireless sensor networks can occur 



for various reasons. First, sensor nodes are brittle, and they 
may fail due to exhaustion of batteries or damage by an 
external event. In addition, nodes may seize and communicate 
wrong readings because of environmental influence on their 
sensing components. Secondly, as in wireless ad hoc 
networks, the links are prone to failure, triggering network 
partitions and initiate active changes in network topology. 
The external object or environmental conditions may fail the 
links permanently or block temporarily. Packets may be ruined 
due to the flawed nature of communication. Third, congestion 
triggers packet loss, it is mainly because a number of nodes 
coinciding transitions from a power saving state to an active 
transmission state in response to an event-of-interest [1]. 

Conventional networks are not concerned with energy 
consumption due to the fact that wired networks are con- 
tinuously powered and wireless ad hoc devices are recharged 
repeatedly. The protocols for these networks are aimed at 
achieving point-to-point reliability, whereas wireless sensor 
networks are concerned with reliable event detection. In sen- 
sor networks, node failure occur more frequently than wired, 
where servers, routers and client machines are assumed to 
operate normally, significant overhead is desired. The proto- 
cols in fixed network bank on handy MAC layer protocols 
that avoid packet collisions, hidden terminal problem and 
channel errors by using physical carrier sense (RTS/CTS) 
and virtual carrier sense (monitoring the channel); in wire- 
less sensor networks. The protocols at the MAC layer have 
to meet additional challenges, such as synchronizing a node's 
sleeping and wake times, and it can only diminish the packet 
collision problem, but cannot give an absolutely solution. 
These remarks directs that new fault tolerant protocols are 
necessary for sensor applications to operate successfully 
and that these protocols should ensure reliable data delivery 
while minimizing energy consumption. 

In customary distributed systems the nomenclature of 
different fault tolerant techniques used is [2]. Fault 
prevention: this is to avoid or prevent faults.2.Fault detection: 
this is to use different metrics to collect symptoms of possible 
faults. 3. Fault isolation: this is to correlate different types of 
fault indications (Alarms) received and suggest different fault 
hypotheses.4.Fault identification: this is to test each of the 
proposed guesses in order to exactly localize and recognize 
faults.5. Fault recovery: this is to treat faults, i.e., reverse their 
adverse effects. There are techniques that address a 
combination of all these aspects. These techniques work at 
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diverse layers of the network protocol stack. Major fault 
avoidance procedures operate in the network layer, 
accumulation redundancy in forwarding paths; a majority of 
fault detection and recovery methods run at the transport 
layer; and a few fault recovery practices execute at the 
application layer, masking faults for the duration of off-line 
data processing. In WSNs the communication between 
sensors is less reliable, i.e. messages may be lost due to 
noise. Fault tolerance also ensures that a system is available 
for use without any interruption in the presence of faults; 
thus fault tolerance increases the reliability, availability and 
consequently dependability of the system. 
Contribution: - In this paper FTNB is proposed. The paper 
addresses three different classes of faults that are the preset, 
balanced and unsymmetrical faults. The faulty nodes under 
these categories are identified effectively based on the 
normal bias. 

Organization: - The paper is organized as follows. In section 
1 introduction to fault management for wireless sensor 
networks is discussed. The literatures of various exiting 
schemes to handle fault tolerance are elaborated in section 2. 
In section 3 the proposed method for fault tolerance based 
on FTNB is presented. In section 4 the problem with algorithm 
and in section 5 results are discussed. The finishing remarks 
are specified in Section 6. 

II. Literature Survey 

The problem of positioning or fixing a sensor network to 
guarantee a definite level of multipath connectivity between 
all nodes is discussed in [3].To handle node catastrophes, a 
greedy and distributed algorithm that produces high-quality 
placements of additional sensors based on absolute minimum 
is considered. The optimal redundancy level that could satisfy 
QOS requirements while increasing the lifetime of the WSN 
is discussed by Ing-Ray Chen et al., [4]. Lifen Yuan et al., [5] 
used Neural Network for classifying faulty nodes; they 
concentrate on pre-processing functions of neural networks 
and used entropy and kurtosis as the two feature parameters 
to identify the faulty nodes. Hong Qiao et al., [6] proposed a 
novel method called a reference model approach to analyze 
the stability of dynamic network. Depending on whether the 
neural states or local field states are taken as basic variables, 
a dynamical neural network is divided into static neural 
network model and local field neural network model. In [7] 
fault tolerance using modified hausdroff distance method that 
determines the correct data range to be detected by the sensor 
nodes is discussed, the actor node makes use of the data 
range to identify a faulty node. 

In [8] a group of tasks are assigned to the node pairs and 
the outcomes are compared by neighboring nodes. They use 
generalized comparison model (GCM) with neural network. 
The gatherings of all agreements and disagreements are used 
to identify the set of permanently faulty nodes. Kuo-Feng et 
al., [9] has proposed an approach where the data 
inconsistency failure is dealt by forwarding the data through 
multiple paths. To identify the suspicious node in the faulty 
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path, a sink node sends diffusion data packets back to the 
source node through both correct and faulty paths. Staddon 
et al., [10] proposed two approaches of resuming the network 
routing paths from the silent nodes (i.e., failed nodes) that 
are detected in each network routing update epoch. The 
particle swarm optimization algorithm to deal with fault 
coverage and also ensure network connectivity is discussed 
by Peng Li et al. , [ 1 1 ] . , According to this algorithm the nodes 
only in the fault region Participate in self-organized movement 
hence saving energy, and increase network lifetime. Duk-Jin 
Kim and B. Prabhakaran., [12] has proposed fault node 
detection algorithms for BSNs that work with different types 
of features extracted from a sensor node data based on node 
history and without node history. Stefano et al., [13] 
considered a solution to recover data loss after a node failure 
by duplicating and distributing redundant information of 
sensed data among other nodes in advance. Koushanfar et 
al., [14] suggested a heterogeneous back-up scheme for 
tolerating and healing the hardware malfunctions of a sensor 
node. The key idea is to adapt application algorithms and/or 
operating systems to match the available hardware and the 
requirements of the application. The online based distributed 
fault detection algorithm [15] for multiple sensors averages 
the history of samples reducing the detection delay for a 
fixed false alarm. Cham Virmani and Khushboo Garg [16] has 
discussed various existing adaptive fault tolerant algorithms 
and compared each of their effectiveness. In [17] dual- 
weighted trust evaluation scheme to detect malicious and 
malfunctioning node for hierarchical sensor network is 
proposed. Trust values of sensor nodes are used as weights 
at the forwarding node to reflect the event. 

A. Background 

The fault tolerance using quartile method where correct 
sensor nodes are detected based on data discreteness has 
been proposed by Chiu-Ching Tuan et al., [18]. This method 
is applied for small network with automated structure having 
variable range and the correct recognition rate achieved is 
93.9%.Chen J.R et al., [19] and Peng Jiang [20] have discussed 
distributed fault detection and improved distributed fault 
detection scheme for detecting faulty nodes. This method 
applies to semi-automated architecture with fixed range and 
small network resulting in a recognition rate of 94%. 

III. Proposed Methodlogy 

In this section fault tolerance in wireless sensor and ac- 
tor networks is built based on Normal Bias Method (FTNB) 
as shown in the Figure 1 . 

A. Network Initialization 

Nine Hundred sensor nodes are randomly deployed in a 
30*30 m area. All the nodes in the network have the same 
transmission range of e3m. The Nodes with faulty sensors 
and permanent communication faults (including lack of power) 
are to be identified, and to be removed or isolated from the 
network. Sensor nodes which generate incorrect sensing data 
or fail in communication intermittently are treated as 
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Fig. 1. Flow Chart of FTNB 

operational nodes, and thus are diagnosed as intermittently 
faulty, i.e nodes that misbehave occasionally and can be 
considered for communication. The node deployment should 
be such that at least one or more nodes are in the 
communication range of the source node as shown in figure 
2; else the network should be redeployed till this condition is 
achieved. 




Nodes within 
Communication Range 
considered for 




Fig. 2: Nodes within Range are listed for Communication 

The placements of nodes are dense and due to this the 
spatial correlations are satisfied. For each fault-free sensor 
node its neighbouring fault-free sensor nodes have similar 
sensing values. Let v-. and v- be neighbours of each other 
and x-. denote the sensed data at node v-.. Then the condition 
to be satisfied by v- . and v - is \x . - x - \ < = A where 'A ' may 
vary depending on the applications. For example if the 
monitored parameter is temperature then two neighbour 
sensors should exhibit similar valve where in the difference 
i.e 'A' will be very small approximately 0.3. 

B. Classify Faults 

Fault may occur at different levels of WSN, such as 
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physical layer, hardware, system software, and middleware. 
As sensors are most prone to malfunction, the focus is on 
the fault-free sensors, i.e nodes are able to communicate and 
process when their sensors are faulty. Taking account of 
sensor measurements, the following sensor fault models are 
considered: (i) Preset fault, a fault sensor constantly report a 
preset valve, (ii) Balance fault, the measured data of a faulty 
sensor is manifested as a calibration offset to the right value, 
(iii) Unsystematic fault, the measured data of a faulty sensor 
is affected by a zero-mean noise with high variance. 

C. Fault Tolerance by Normal Bias Method 

Any sensors v. is detecting or sensing valve x. .The node 
v. has a set of 'N' neighbors that also sense valve given by 
x.,. xn and their corresponding bias are X xn .The 



j 0=1,2, 3... N) 
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normal bias can now be calculated using the following 
equation (1). 

A reward function shown in equation (2) is defined based 
on the sensed and normalized valve. The reward function 
then decides the confidence score. Initially the confidence 
score is set to a maximum valve then it is decremented in each 
round. If the confidence score is reduced to zero the node is 
declared faulty. 

nXiXiMt I,... else where ^ 

In order to measure the correctness of the detected faults 
two parameter alpha and beta are defined. Where alpha 
represents the faulty nodes and beta represents the fault free 
nodes detected as faulty. Using these parameters the 
performance of the algorithm in terms of correct recognition 
rate and the false fear rate are measured. 

IV. Algorithm 

Problem Definition: The FTNB is used to categorize the 
type of fault and also detect the faulty node. The actor node 
based on the received valve performs appropriate action by 
either discarding or accepting data from an intended node. 
The objective is to detect nodes that read faulty valves by 
collecting the neighbour node valves and also identified the 
fault class of the node. The fault reading nodes are isolated 
from transmitting the faulty valves to the actor temporarily or 
permanently based on the type of fault class. The 
implementation steps of FTNB algorithm is discussed in the 
Table I. 

V. Results And Discussion 

The performance of the proposed method is evaluated by 
MATLAB version 7.7b R2008. For simulation it is assumed 
that faults are independent of each other. Nine Hundred sensor 
nodes are deployed randomly in a network area of 30*30m. 
The nodes only within the transmission range of "3 
participate in communication. The neighbour nodes list for a 
source is generated based on the normal bias and the 
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Table I: Proposed FTNB Algorithm 



Table II: Comparison of different types of Faults 



Data: 



Result: 



A list of node array within range, num2str (i) {I varying 
from 1: N] 



An estimate of correct recognition rate and false fear 
rate. 



Begin: Count=0 

{For i= 1:1: p (time instance for each node) 
For j=l: N 

Xi= Reading from Sensor 
Is ^Randomly generated Neighbour list 
Random list of faulty nodes=list 
For K=l: length (Is) 
Set initial confidence score=l 
Find Xi 

If f (x i; Xi)>threshold 
Decrement confidence score 
If f (x^ XO ^threshold 
Return (faulted node) } 
For i=l: size (faulted node, 1) 

kk=result of the comparison between (faulted node, list) 
If is empty(kk) 
Returns (0) 

Faulty free detected faulty = Faulty free detected faulty +1 
Else 

Returns (1) 

Faulty node ^Faulty node+1 
End 

CRR= Faulty node / (size (list, 1)) 

FFR= Fault free detected as faulty / (node-size(list,l)) 



confidence score set. The performance parameters such as 
Correct Recognition Rate (CRR) and False Fear Rate (FFR) 
are used to evaluate the algorithm. 

(i)Correct Recognition Rate: It is defined as the ratio of the 
number of faulty sensor nodes detected to the total number 
of faulty nodes as shown in Equation 3. 



CRR = 



(3) 



Where N is the number of nodes deployed and p e is the error 
probability. Typically the value of CRR should be 1 but, as 
probability increases CRR drops. 

(ii)False Fear Rate: It is defined as the of the number of fault- 
free sensor nodes diagnosed as faulty to the total number of 
fault-free nodes as shown in Equation 4. 



FRR - 



♦V& . of a ctual / Vj ! f frtt Smor wdmX > { 1 - 1' . ) "« 

(4) 



Ideally the value of FFR should be but, as error 
probability increases FFR increases. The error probability to 
the network is varied from 0.05 to 0.3 . -If theerror is increased 
beyond 0.3 then the CRR decreases and FFR increases that 
will drop the performance of the algorithm below 90%. 

Table II shows the CRR and FFR for three categories of 
faults for a failure ratio of 0.2. The performance of the algorithm 
depends on the number of sensors nodes deployed in the 
event area, the failure ratio introduced the defined threshold 
and the type of fault as the each fault is independent of the 
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other. For each fault condition and a failure probability the 
node reads for 20 time instances. The source node and its 
neighbour list is short listed. The percentages of nodes that 
are fault free and faulty are calculated depending on number 
of nodes deployed and node failure introduced to calculate 
FFR and CRR. 

Figures 3, 4 and 5 shows the results of CRR and FFR 
(figure of merit) for different error probabilities valves 
introduced. It is observed from the graph that network can 
sustain faults up to 0.2 error probability and show Correct 
Recognition Rate of above 95% and maintaining the False 
Fear Rate below 0.05 for the preset and balanced fault while 
below 0. 1 for unsystematic fault due its random nature. The 
different failure probability levels help in handling different 
type of faults like preset, balanced and unsystematic faults. 
To reduce the size of the exchange message a neighbor list is 
prepared, by this node location and every sensor reading 
message size is avoided ie., only those nodes in the neighbor 
list will be eligible for communications is reduced that in turn 
reduces the energy spent by the node. Here the nodes 
communicate only with nodes in its fault tolerance range. 
The message exchange is between the node and neighbor 
nodes and this does not require routing protocol there by 
contributing to network lifetime by minimal communication. 

A. Pre set fault 



1-7- 

D.9 
D.fl 
□.7 
D.6 
S Z = 
D.4 
D.S 
□.2 
□.1 



Node Ferfom ance under different error prob ability 

— ¥ f ==S=Z 




O.I 5 0.2 
Nodes failure Probability 



Fig. 3: Variation of CRR and FFR (Preset Fault) with Error 
Probability 
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B. Balance fault 
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Fig. 4: Variation of CRR and FFR (Balance fault) with Error 
Probability 

C. Unsystematic fault 
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Fig. 5: Variation of CRR and FFR (Unsystematic fault) with Error 
Probability 
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Fig. 6: Variation of CRR and FFR (Preset Fault) with the Number of 
Nodes 
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Fig. 7: Variation of CRR and FFR (Balanced Fault) with the Number 
of Nodes 
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Variation of CRR and FFR (Unsystematic Fault) with the 
Number of Nodes 
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Figure 6, 7 and 8 shows the variation of CRR and FFR for 
varying number of node deployment keeping the 
communication range constant at "3 for all the three types of 
faults. It is observed that the algorithm performs better for 
lower error probability. The FFR increases as error probability 
increases as more number of sensors misbehaves. 

Figure 9, 10, and 1 1 show the variation of CRR and FFR 
keeping the number of nodes constant ie., for node 
deployment valve of 900 nodes and varying the range of 
communication for the all three category of faults. It is 
observed that the results are best obtained from the algorithm 
for a node deployment of 900 when the range is "3 and the 
error is 0.05. The performance of the algorithm drops with 
error of 0.3, as the number of fault free sensors detected as 
faulty increases. 
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Variation of CRR and FFR with Range 
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Fig. 9 : Variation of CRR and FFR (Preset Fault) with Range 
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Fig. 10: Variation of CRR and FFR (Balance fault) with Range 
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Fig. 12: Plot of No of Actual Faulty Nodes Vs No faulty Nodes 

detected 

Figure 1 2 shows the number of faulty nodes introduced 
to the number of faulty nodes detected. It is observed the 
number of faulty nodes detected is almost equal to the number 
of faulty nodes introduced. Hence our algorithm is better. 

Plot of false fear with error introduced 
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Fig.13: Plot of No of Fault Free Nodes detected faulty 

Figure 13 shows the plot of number of fault free sensors 
detected as faulty. It is observed that the number of fault free 
sensors detected as faulty is very low up to error probability 
of 0. 15 and the number of fault free sensors detected as faulty 
increases very much as the error probability is increased from 
0.15to0.3. 

The techniques, fault calculations, neighbouring nodes, 
range and CRR for existing algorithm and proposed algorithm 
are tabulated in Table 3. The Chiu Ching Tuan et al.,[18] used 
AFTM technique for fault calculation and CRR. Chen et al. , [ 1 9] 
proposed DFD technique with fixed range of communication 
to compute fault calculation and CRR. Peng Jiang [20] 
discussed improved DFD technique for fixed range of 
communication and for smaller network size to compute CRR. 
It is observed that the percentage CRR is 97% in the proposed 
algorithm compared to the less percentage CRR valves in the 
existing algorithms. Hence the proposed algorithm is better 
in Correct Recognition Rate compared to existing algorithm. 
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Table III: Comparison of Proposed with Existing Algorithms 



Methods 


Techniques/ 
Architecture 


Fault Calculation 


Neighbour 

Nodes 
Involved 


Network 
size 


Node Involved 
in Fault 
Calculation 


Range 


CRR 


Chiu-Ching Tuan et al ., [18] 


AFTM/ Automated 


Averaging and comparing 
with allowable threshold 


No 


Small 


Actor Node 


Variable 


93.9% 


Chen etal., [19] 


DFD/Semi- 
Automated 


Local comparisons are 
made and supported by 
majority voting 


Yes 


Small 


Source Node 


Fixed 


83% 


Peng Jiang [20] 


Improved 
Automated 


Local comparisons are 
iiictuc cuiu aiijjjjui icu uy 
majority voting. 


Yes 


Small 


Source Node 


Fixed 


94% 


Ozakietal., [21] 


FTQM/ Automated 


Data is sorted and then 
divided into four 
regions(Quartile) 


No 


Small 


Actor Node 


Variable 


80.1% 


Proposed FTNB 


Proposed FTNB 
/S emi- Automated 


Normal Bias of the 
neighbor nodes defines 
the confidence score. 


Yes 


large 


Source Node 


Fixed 


97% 



VI. Conclusion 

In WS AN it becomes very important to cater to sensing 
faults and also to distinguish between various kinds of faults 
and build a reliable fault tolerant strategy suitable to its class. 
In this paper FTNB algorithm for fault tolerance is proposed. 
The nodes should have at least one neighbour node so that 
the detected valve can be compared or be in the region of 
valves emitted from all the neighbour nodes. Based on the 
received normal the confidence score of the node is varied. 
This decides the correctness of the node. The proposed 
method has a low implementation complexity and gives bet- 
ter results in terms Correct Recognition Rate and False Fear 
Rate for varying error probabilities. In future FTNB can be 
integrated with communication link fault tolerance mecha- 
nism. 
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